Search CORE

4 research outputs found

Putting An End to End-to-End: Gradient-Isolated Learning of Representations

Author: Löwe S.
O'Connor P.
Veeling B.S.
Publication venue: Neural Information Processing Systems Foundation
Publication date: 01/01/2020
Field of study

International Migration, Integration and Social Cohesion online publications

Putting An End to End-to-End: Gradient-Isolated Learning of Representations

Author: Löwe S.
O'Connor P.
Veeling B.S.
Publication venue: Neural Information Processing Systems Foundation
Publication date: 01/01/2020
Field of study

We propose a novel deep learning method for local self-supervised representation learning that does not require labels nor end-to-end backpropagation but exploits the natural order in data instead. Inspired by the observation that biological neural networks appear to learn without backpropagating a global error signal, we split a deep neural network into a stack of gradient-isolated modules. Each module is trained to maximally preserve the information of its inputs using the InfoNCE bound from Oord et al. [2018]. Despite this greedy training, we demonstrate that each module improves upon the output of its predecessor, and that the representations created by the top module yield highly competitive results on downstream classification tasks in the audio and visual domain. The proposal enables optimizing modules asynchronously, allowing large-scale distributed training of very deep neural networks on unlabelled datasets.Comment: Honorable Mention for Outstanding New Directions Paper Award at NeurIPS 201

arXiv.org e-Print Archive

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Predictive Uncertainty through Quantization

Author: van den Berg R.
Veeling B.S.
Welling M.
Publication venue: University of Amsterdam
Publication date: 12/10/2018
Field of study

UvA-DARE

Predictive Uncertainty through Quantization

Author: van den Berg R.
Veeling B.S.
Welling M.
Publication venue: University of Amsterdam
Publication date: 12/10/2018
Field of study

High-risk domains require reliable confidence estimates from predictive models. Deep latent variable models provide these, but suffer from the rigid variational distributions used for tractable inference, which err on the side of overconfidence. We propose Stochastic Quantized Activation Distributions (SQUAD), which imposes a flexible yet tractable distribution over discretized latent variables. The proposed method is scalable, self-normalizing and sample efficient. We demonstrate that the model fully utilizes the flexible distribution, learns interesting non-linearities, and provides predictive uncertainty of competitive quality